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Abstract 

We study how a behavior (an idea, buying a product, having a disease, adopting 
a cultural fad or a technology) spreads among agents in an a social network that 
exhibits segregation or homophily (the tendency of agents to associate with others 
similar to themselves). Individuals are distinguished by their types (e.g., race, gender, 
age, wealth, religion, profession, etc.) which, together with biased interaction patterns, 
induce heterogeneous rates of adoption. We identify the conditions under which a 
behavior diffuses and becomes persistent in the population. These conditions relate 
to the level of homophily in a society, the underlying proclivities of various types 
for adoption or infection, as well as how each type interacts with its own type. In 
particular, we show that homophily can facilitate diffusion from a small initial seed of 
adopters. 
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1 Introduction 

Societies exhibit significant homophily and segregation patterns. 1 How do such biases in 
interactions affect the adoption of products, contagion of diseases, spread of ideas, and other 
diffusion processes? For example, how does the diffusion of a new product that is more 
attractive to one age group depend on the interaction patterns across age groups? How does 
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1 For background on homophily and some of its consequences, see McPherson et al. (2001) and Jackson 
(2008). 



1 



the answer depend on the differences in preferences of such groups, their relative sociabilities, 
and biases in the interactions? 

We answer these questions by analyzing a general model of diffusion that incorporates 
a variety of previous models as special cases, including contagion processes studied in the 
epidemiology literature such as the so-called SIS model (e.g., Bailey 1975, Pastor-Satorras 
and Vespignani, 2001), as well as interactions with strategic complementarities, such as in 
the game theoretic literature and network games (e.g., Galeotti et al, 2010). 2 Our model 
incorporates types of individuals who have different preferences or proclivities for adoption, 
as well as biases in interactions across types. 

In particular, we examine whether or not diffusion occurs from a very small introduction 
of an activity in a heterogeneous and homophilous society. We first concentrate on the focal 
situation with only two types of agents. Within this case, the most interesting scenario turns 
out to be one where one type would foster diffusion and the other would not if the types 
were completely segregated. In that scenario, we show that homophily actually facilitates 
diffusion, so that having types biased in interactions towards their own types can enhance 
diffusion to a significant fraction of both types. Having a higher rate of homophily, so that a 
group is more introspective, allows the diffusion to get started within the group that would 
foster diffusion on its own. This can then generate the critical mass necessary to diffuse the 
behavior to the wider society. In contrast, societies exhibiting less homophily can fail to 
foster diffusion from small initial seeds. 

We then move to the general case of many types. Our main characterization theorem 
generalizes the features from the two-agent case, showing that diffusion relates to a condition 
on the largest eigenvalue of an interaction matrix which tracks the initial adoption rates of 
various types of individuals, that is, their adoption rates from small initial seeds. Again, we 
show that homophily can facilitate diffusion, showing that a sufficient condition is that some 
type (or group of types) that would adopt on its own is sufficiently homophilous to give the 
diffusion a toehold. We discuss how this extends the intuitions from the case of two types. 

2 An Illustrative Example: The Heterogeneous SIS 
model with Two Types 

To fix ideas and preview some of the insights from the general model, we begin with a case 
where there are just two types of agents and the contagion follows a simple and well-studied 
process. 

In particular, consider an infectious disease spreading in a population with two groups: 
the young and the old. Our aim is to analyze whether or not diffusion of the disease occurs. 
That is, if we start with a small seed of infected agents, will the infection spread to a sig- 

2 For background on diffusion in networks see Newman (2002), Jackson and Yariv (2005, 2007, 2010), 
Lopez-Pintado (2006, 2008, 2010), Jackson and Rogers (2007) among others. 



2 



nificant fraction of both populations and become endemic? In order to answer this question 
consider the following heterogeneous version of the canonical SIS model. 3 

Agents can be in one of two "states" : infected or susceptible. A susceptible agent becomes 
infected at an independent probability v > from each interaction with an infected agent. 
Conversely, with a probability 5 > per unit of time an infected individual recovers and 
becomes susceptible again. 4 The crucial parameter of the model is the relative spreading 
rate, A = |, which measures how infectious the disease is in terms of how easy it is to 
contract compared to the rate at which one recovers. 

An interesting case for our analysis is one where the population is heterogeneous in terms 
of the proclivities for getting infected. In particular, imagine that the older are more (or 
less) vulnerable to the disease than the young. More precisely, if Ai is the spreading rate of 
the young and A 2 of the old, then we allow Ai ^ A 2 . 

In addition to their age, individuals are also potentially differentiated by the rates at 
which they interact with other individuals, where "interact" is taken to mean that they have 
a meeting with an individual which could transmit the infection if one of them is infected 
and the other is susceptible. In particular, apart from his or her type, each individual is 
characterized by a degree d; the number of agents the individual meets (and is potentially 
infected by) every period. Let Pi(d) be the degree distribution of individuals of type i; that 
is, the fraction of agents of type i that have d meetings per unit of time. 

Also, for the purposes of this example, we stick with what is standard in the random 
network literature, and take the meeting process to be proportionally biased by degree. Thus, 
conditional on meeting an agent of type i, the probability that he or she is of degree d is 
p-, where (d)i is the average degree among type i agents ((d) i = ^2dPi(d)d). 

To capture homophily, let < n < 1 be the probability that a given type i agent (old or 
young) meets his or her own type, and 1 — tt be the probability of meeting an agent of the 
other type. For example if the populations are of even size, then having tt > 1/2 means that 
agents are mixing with their own type disproportionately. 

We say that diffusion occurs from a small seed (with a formal definition below) if starting 
from an arbitrarily small amount of infected individuals (of either type), we end up with a 
nontrivial steady-state infection rate among the population. 

Let TT = - l Z ^iAirf2A 2 _ h ^ = g>£_ 

Theorem 1 Diffusion occurs from a small seed in the two type SIS model if and only if one 
of the following holds: 

2) A1A2 < tV and ir > n 

7 did 2 

3 Thc so-called SIS (Susceptible-Infected-Susceptible) model is a basic one used by the epidemiology 
literature to describe such situations (e.g., Bailey 1975, Pastor-Satorras and Vespignani, 2000, 2001). 

4 The SIS model allows a recovered person to catch the disease again. An obvious instance is the standard 

flu. 
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The proof of the theorem appears in the appendix, and is a special case of our more 
general results below. 

The condition for diffusion in the standard (homogeneous) SIS model is A > i- (e.g., 
Pastor-Satorras and Vespignani, 2001). Thus, we see how this generalizes in the above 
theorem. 

Theorem 1 yields the following straightforward consequences. 

Corollary 1 The following statements hold for the two-type SIS model: 

1) If diffusion occurs within each type when isolated ( when 7r = 1 ), then it would also 
occur when there is interaction among the two (when n < 1). 

2) If diffusion does not occur in either of the types when isolated, then it would not occur 
when there is interaction among the two. 

3) If diffusion occurs among one type but not the other when isolated, then it will occur 
among the whole population if the homophily is high enough. 

The most interesting scenario turns out is the last one, such that one of the types would 
foster diffusion if isolated, whereas the other would not (i.e., Ai > and A2 < --r). In that 

Ol (12 

scenario, homophily either plays no role (that is, when A1A2 > tV) so that any homophily 
level will allow diffusion, or else it actually facilitates diffusion (that is, when A1A2 < tV i n 
which case it must exceed 7r ). 

In the latter case diffusion occurs only if the two types are sufficiently biased in interac- 
tions towards their own types (i.e., it is sufficiently large). The intuition for such a result 
is the following. Having a higher rate of homophily, so that a group is more introspective, 
allows the diffusion to get started within the group that would foster diffusion on its own. 
In turn, it can then spread to the wider society. 

3 The General Model 

With this introduction behind us, we now describe the general model. 
3.1 Types and Degrees 

Each agent is characterized by his or her degree d > and type i G T = {1, m}. 

Since the number of individuals of each type can differ, let n(i) be the fraction of indi- 
viduals of type i. 

An agent's degree d indicates the number of other agents that the agent meets (and is 
potentially influenced by) before making a decision in a given period. The meeting process 
is allowed to be directional; i.e., agent h meeting (paying attention to) agent k does not 
necessarily imply that k pays attention to h. So, although we use the term "meeting," the 
interaction need not be reciprocal. Of course, a special case is one where the interaction is 
mutual. 
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Different types may have different distributions in terms of how frequently they meet 
other agents. In particular, let Pi(d) be the degree distribution of individuals of type i. That 
is, Pi(d) is the fraction of type % individuals who have d meetings per period. Thus, there 
can be heterogeneity among agents of a given type, in terms of how social they are. 

An agent's type i shapes both the agent's relative interaction rates with other types 
of agents and the agent's preferences or proclivity for infection. In particular, ir^ is the 
probability that an agent of type i meets an agent of type j in any given meeting. Clearly, 

m 

^^7Tjj = 1. The bias in meetings across types is then summarized by the matrix 

3=1 



( 



n 



TTll 



\ 
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We assume that II is a primitive matrix (so that 11* > for some t). This ensures that 
there is at least some possibility for an infection that starts in one group to reach any other, 
as otherwise there are some groups that are completely insulated from some others. 



3.2 The Random Meeting Process 

In order to study this system analytically, we examine a continuum of agents, N = [0, 1]. 

This continuum is partitioned into agents of different types, and then within types, by 
their degrees. 

There are two ways in which the meeting process can be biased: by type and by degree. 

In particular, as mentioned above, the relative proportion of a type % agent's meetings 
with type j is described by the term 7Ty, which captures relative biases in meetings across 
types. So, in a given period, an agent of type i with degree d expects to meet ditij agents of 
type j. Those agents are randomly selected from the agents among type j. 

We also allow the meeting process to be biased by degree. The probability that an agent 
meets an agent of degree d out of those of type j is given by 

Pj(d)w,(d). 

If there is no weighting by degree, then an agent equally samples all agents of type j and 
Wj(d) = 1. This would require a directed meeting process, such that an agent observes 
members of a given type uniformly at random, independently of their meeting process or 
sociability. If instead, meetings are proportional to how social the agents of type j are, then 
Wj(d) = dj (d)j, where (d)j is the average degree among type j agents. This latter condition 
covers cases in which meetings are reciprocal. 5 

Our formulation also allows for other cases. For simplicity, we assume that Wj(d) > for 
all j and d such that Pj{d) > 0. 

5 For some details and references for random meeting processes on a continuum, see the appendix of 
Currarini, Jackson and Pin (2009). 
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3.3 The Infection or Adoption Process 

In each period an agent is in one of two states s G {0, 1}. Either the agent has adopted the 
behavior and are in state s = 1 (active, adopted, infected...), or they have not adopted the 
behavior and are in state s = (passive, non-adopter, susceptible...). The agents' actions 
are influenced by the actions of others, but in a stochastic manner. 

Agents are heterogeneous with respect to their proclivities to adopt the behavior. A 
passive agent of type % adopts the behavior at a rate described by a function fi(d, a), where 
d is the agent's degree (number of meetings per unit of time) and a is the number of agents 
whom she meets who have adopted the behavior. (Details of the dynamics will be given 
below.) The reverse process, by which an active agent of type i becomes passive happens at 
a rate described by a function g^d, a). The functions fi(d, a) and gi(d, a) are the primitives 
of the diffusion process and are assumed to satisfy some basic conditions: 

Al fi(d, 0) = for each i and d: a passive agent cannot become active unless she meets at 
least one active agent. 

A2 fi(d, a) is non-decreasing function in a: the adoption rate is non-decreasing in the num- 
ber of active agents met. 

A3 fi(d, 1) > for each % and some d such that Pi(d) > 0. This condition implies that for 
each type of agent there exists some degree such that the rate of adoption for agents 
with such a degree is positive when they meet at least one active agent. 

A4 gi(d, 0) > for each i and d: it is possible to return from active to passive when all 
agents met are passive. 

A5 gi(d, a) is non- increasing in a: the transition rate from active to passive is non-increasing 
in the number of active agents met. 

This general model of diffusion admits a number of different models, including models 
based on best-response dynamics of various games (with trembles) as well as epidemiological 
models. Here are a few prominent examples of processes that are admitted: 

• Susceptible-Infected-Susceptible (SIS diffusion process): fi(d,a) = u^a and gi(d, a) = 
5i, where Vi > and 5i > 0. 

• Myopic-best response dynamics by agents who care about the relative play of neighbors 
(Relative Threshold diffusion process): fi(d,a) = z/j if | > q and fi(d,a) = 
otherwise. Also gi(d,a) = Si if | < q and gi(d,a) = otherwise, where z/j > and 
6i > and q e [0, 1]. 

• Myopic-best response dynamics by agents who care about the aggregate play of neigh- 
bors (Aggregate Threshold diffusion process): fi(d,a) — if a > mh\[q, d] and 
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fi(d,a) = otherwise. Also, gi(d,a) = Si if a < q and gi(d,a) = otherwise, where 
u t > and 8 { > and q > 0. 6 

• Imitation dynamics when a neighbor is chosen uniformly at random (Imitation dif- 
fusion process): a) = vA and gi(d, a) — — , where i/j > and <5j > 0. 



3.4 Steady States and Dynamics 

In order to keep track of how diffusion or infection occurs, we analyze a continuous time 
dynamic, where at any given time t > the state of the system consists of a partition of the 
set of agents in "active" and "passive." 

As is standard in the literature, we study the continuous system as an analytically 
tractable alternative to the stochastic discrete system. 7 

Let Pid(t) denote the frequency of active agents at time t among those of type i with 
degree d. Thus, 

d 

is the frequency of active agents at time t among those of type i, and 



is the overall fraction of active agents in the population at time t. 
The adoption dynamics are described as follows: 

= -pjfiratetf*® + (1 - M^rcrie^t), W 

where rate®~f l (t) is the rate at which a passive agent of type i and with degree d becomes 
active, whereas rate^ l (t) stands for the reverse transition. In order to compute these 
transition rates we must calculate first the probability that an agent of type i has of sampling 
an active agent. Denote this probability by r p i {t). It is straightforward to see that 

hit) = *n E PM^MPjA*)- ( 2 ) 

3 d 

Given 'p i (t) then 

d 

rate°J\t) ^ffaa) Qft(t)"(l-ft(t)) M 

a=0 



6 In order to satisfy [A3] in this case, it is necessary to have some probability of degree 1 agents for each 
type, or else to have q = 1. 

7 See Jackson (2008) for discussion of what is known about the approximation. 
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and 

d 

rate}f(t) = j>(d, a) - ft(«)) (d_o) - (3) 

a=0 

A steady-state is when df>1 ^ = 0, which implies that we can write the steady state 
level Pi d(t) as being independent of time. Solving from equation (1) leads to the following 
necessary condition 

rate®^ 1 

Phd = rate^ + rate}^ (4) 

If we specify the rates Pj(t) for each type i, then this determines the rates of transition 
under (3). This in turn, leads to a level of p id for each i, d under (4) that would have to 
hold in equilibrium, which in turn determines the rates at which active agents would be 
met under ft(i). Thus, replacing equation (4) in equation (2) we find that a steady state 
equilibrium corresponds to a fixed point calculation as follows: 

Pi = Hi(Pl---,Pn)> ( 5 ) 

where 

rate 0_s>1 

H i (p 1 ,...,p n ) = Y, t« P M) W M) ' rate o^i +''ate 1 7° 



The previous system of equations implicitly characterizes the steady states of the dy- 
namics, since by solving for 'p i we can easily find the fraction of adopters of each type p i and 
ultimately the overall fraction of adopters p. 



3.5 Diffusion or Contagion from a Small Seed 

We now consider the following question which is the central focus of our analysis: If we 
start with a small fraction of adopters, would the behavior spread to a significant fraction 
of the population(s)? In other words, we determine the conditions that lead to the diffusion 
of a new behavior to a significant fraction of the population when there is a small initial 
perturbation of an initial state in which nobody is infected or has adopted the behavior; so 
starting from (ft, ... , p n ) = (0, . . . , 0). 8 _^ 

Thus, in what follows we explore the behavior of the system of (5) near p — ; in order 
to see conditions under which it is a stable steady-state. 

The system of equations described in (5) can be approximated by a linear system in the 
neighborhood of p = as follows: 

p = Ap 

8 Notice that the question of moving away from all 1 is completely analogous, simply swapping notation 
between and 1 throughout the model. 



where 



I FIT,. \f> 



A 
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dH, 



dp 



~ P=0 I 



As we show in the appendix, filling in for the expressions of ^^|p = o, we can rewrite A as 



/ 
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TTllXi 
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where 



Xi 



The term Xi is a nicely interpretable factor. It is the relative growth in infection for due 
to type i, but adjusted by the relative rates at which type i's will be met by other agents 
(so weighted by degrees according to Wi(d)). 

Note that if when we start with some vector of p^-'s near (so our approximation is 
correct), but with positive entries, and then we end up with a new vector that is at least as 
large as the starting vector, then it must be that is an unstable solution. 

Definition 1 There is diffusion from a small seed if and only if for any small e > 0, there 
exists some v such that < v j < e for all i and Aw > v. 

Thus, diffusion from a small seed requires that beginning any small fraction of initial 
adopters the "dynamics" lead to a larger fraction of adopters. 

We remark that if is unstable relative to some small initial seed v > 0, then it is 
unstable relative to any small initial seed v > 0. That is, if A~v > v, then for any v > 
there is some t such that A t: v > v. Furthermore, if there is no diffusion with a particular 
small initial distribution, then there will be no diffusion with any other initial distribution. 
The next Lemma formalizes such argument. 9 

Lemma 1 The condition for the diffusion from a small seed is independent of the distribution 
across types of the initial seed. That is, if Av > v for some v > 0, then for any v > there 
is some t such that A l \ > v. 



9 This result is partly an artifact of the continuous model approximation. For an analysis of the importance 
of the specifics of initial adopters, see Banerjee, Chandrasekhar, Duflo and Jackson (2011). 
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4 Analysis 



4.1 Two Types 

We begin with the analysis of two types, which is a generalization of the results in Section 
2. 

For now, we stick with a setting where tth = H22 — TTj so that there is a symmetry in 
how introspective groups are in terms of their meetings. 

Theorem 2 Let ttq = , }~ X 1T . ■ Diffusion occurs if and only if one of the following 
conditions hold: 

1 ) X1X2 > 1 or 

2) X1X2 < 1 and 71 > 7r . 

The proof of Theorem 2 appears in the Appendix. This result generalizes what was 
found for the heterogeneous SIS model presented in Section 2. The next corollary presents 
straightforward consequences of it. 

Corollary 2 In the two-type setting 

1) If diffusion occurs within each type when isolated, then it would also occur when there 
is interaction among the two. 

2) If diffusion does not occur among either of the types when isolated, then it would not 
occur when there is interaction among the two. 

3) If diffusion would occur among only one of the types when isolated, then it would occur 
among the entire population if homophily is high enough. 

To see Corollary 2 first note that if there is only one type of agent in the population 
then the condition for diffusion established by Theorem 2 reduces to the standard condition 
of a; > 1. Therefore, diffusion occuring within each type when isolated corresponds to 
having x\ > 1 and X2 > 1. Those conditions in turn establish part 1) of the corollary as a 
consequence of part 1) of Theorem 2. If, on the contrary, diffusion does not occur among 
either of the types when isolated, then x\ < 1 and X2 < 1. Straightforward calculations 
show that then the condition for diffusion stated in part 2) of Theorem 2 cannot satisfied 
for any value of n G (0, 1). The last part of the corollary follows vacuously if X1X2 > 1, and 
otherwise diffusion occurs if tt exceeds 7To, establishing the claim. 

4.2 The General Case with Many Types 

Consider the following matrix A: 

( n u Xi . . . vr lm x m \ 



.4 
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We remark that since x\ > for all i (under our assumptions A1-A5), and since IT is 
primitive and nonnegative, it follows that A is primitive and thus A 1 > for some t. 

We can now state the following result, which generalizes the two-type result to many 
types. 

Theorem 3 Diffusion occurs if and only if the largest eigenvalue of A ( denoted by \i) is 
larger than 1. 

The proof of Theorem 3 appears in the appendix. 
Corollary 2 generalizes to the m-type presented next. 

Corollary 3 1) If diffusion from a small seed occurs within each type when isolated, then 
it would also occur when there is interaction among types. 

2) If diffusion from a small seed does not occur for any of the types when isolated, then 
it would not occur when there is interaction among them. 

3) If there is some type for which ir^Xi > 1, then there is diffusion from a small seed. 

4) If there is a subset of types S C T such that J2jes ^ij x 3 > 1 f or eac ^ * e then there 
is diffusion from a small seed. 

We first explain why 1) holds, as 2) is a simple variation. If diffusion occurs within each 
type when isolated then Xi > 1 for all i and therefore 

f 7Tu ... n lm \ 

a> ; ... ; 

\ ^ml • • • "ram ) 

It follows that the largest eigenvalue of A is larger than 1 (since the right-hand side matrix 
is a stochastic matrix and thus has a largest eigenvalue of 1), and the result then follows 
from Theorem 3. 

Next let us explain why 3) and 4) are true, and then discuss the intuition. 3) is clearly 
a special case of 4), so let us discuss why 4) is true. Given that ^2j e s' K v x j > ^ ^ or eacn 
i G S, it follows that for any positive vector u: [Au]i is greater than mmj £S Uj for each 
i G S. Therefore, minj e s[Au]j > mhij^sUj, and so it must be that if u is the eigenvector 
corresponding to the maximum eigenvalue, 10 then Au > u and so the eigenvalue is larger 
than 1. 

1) and 2) of the corollary are fairly intuitive results. Note that in case of just one 
population, then x« > 1 is the condition that characterizes instability of (diffusion from) no 
activity. Thus, if all populations are such that they would experience diffusion from a small 
seed if isolated, then regardless of the interaction pattern there will be diffusion; and similarly 



10 Again, recall that A is primitive and thus has a strictly positive eigenvector corresponding to its largest 
eigenvalue. 
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if none of them would experience diffusion in isolation, then there cannot be diffusion when 
they interact. 

The less obvious cases are 3) and 4), which show that if some type or group of types 
has enough interaction with itself to get diffusion going, then diffusion among the entire 
population will occur. Again, these emphasize the role of homophily in enabling diffusion 
(infection) from a small seed: if there is some group of types that interacts within itself in 
a manner sufficient to enable diffusion among that group, then a toehold can be established 
and diffusion will occur from a small seed. 

Another corollary is that if populations are similar so that they have the same infection 
properties near (i.e., X{ = Xj = x for all i and j), then diffusion properties are determined 
by whether this growth rate is bigger or smaller than 1. 

COROLLARY 4 If Xi = Xj = x for all i and j, then there is diffusion from a small seed if 
and only if x > 1. 

This corollary then emphasizes that in order for the homophily and particular patterns 
of interaction to matter, it must be that types are not just heterogeneous in their interaction 
(the II matrix), but also in their adoption/infection proclivities. If they all have similar 
adoption/infection proclivities, then the particular details of who interacts with whom do 
not affect diffusion from a small seed. 

The proof of this corollary is straightforward. Note that 

( 7Tn ... TTim \ 

A = xll = x : ... : 

\ ^ml • • • 71 'mm / 

It follows that the largest eigenvalue of A is larger than 1 if and only if x > 1 since II is a 
stochastic matrix and has a maximum eigenvalue of 1. 

The less obvious cases are thus such that there are some types who would experience 
diffusion on their own, while others would not. Then the interaction patterns really matter 
and, as already illustrated for the two-type case, some subtle conditions ensue. A sufficient 
condition again is that there is sufficient homophily such that infection can take hold within 
some type, and then it can spread among the population, but more complicated patterns 
among a number of groups can also possibly lead to diffusion from a small seed. 



5 Concluding Remarks 

The focus of most of the related literature has been on analyzing the effect that the degree 
distribution has on diffusion in social networks (see e.g., Jackson and Rogers, 2007, Lopez- 
Pintado, 2008, Galeotti and Goyal, 2009, Galeotti et al., 2010.). This paper, however, 
focuses on the effect of homophily, something which despite its importance has received 
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little attention in the diffusion literature. One of the few exceptions is the paper by Golub 
and Jackson (2010) which also studies the impact of homophily on some (very different) 
learning and diffusion processes. There are important differences between our approach and 
theirs. On the one hand, the diffusion processes analyzed are not the same; we focus on 
what can be thought of as generalizations of the SIS infection model, whereas Golub and 
Jackson (2010) analyze models of diffusion based either on shortest paths communication, 
random walks or linear updating processes. Second, the paper by Golub and Jackson (2010) 
studies the convergence time to the steady state, whereas we analyze whether there is or not 
convergence to a state with a positive fraction of adopters. 

As a first step to understanding the effect of homophily on diffusion, in this paper we have 
concentrated on a specific question; namely the spreading of a new behavior when starting 
with a small initial seed. A central insight here is that homophily can facilitate infection or 
contagion. 

Nevertheless, there are other issues which are left for further work. For example, one 
could evaluate the size of the adoption endemic state as a function of the homophily level. 
There homophily might have conflicting effects: although it can facilitate an initial infection, 
it might be that an increase in homophily can also lead to a decrease in the overall infection 
rate. Indeed, the eventual fraction of adopters attained in the steady state might depend on 
the homophily level in complicated ways. 
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Appendix 

Proof of Theorem 1: The proof of Theorem 1 is a straightforward consequence of the 
proof of Theorem 2 as seen by substituting the functions fi(d,a) = u^a, gi(d,a) = 5i and 
Wi(d) = and obtaining the corresponding a^'s. | 

Proof of Theorem 3: First, note that the system of equations described describing the 
steady state is 

Pi = H i(pnP2,-,Pm)i (6) 

where 

H i (p 1 ,p 2 ,...,pJ = J2 *V £ P i ( rf K' ( d ) rate o^i + rate 1 ^ 

j d 3>d 3'd 

for % G {1, m). 
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This is approximated by a linear system in the neighborhood of . . . , p n ) = (0, 
as follows: 

p = Ap 

where 



0) 



/ 8Hi\ cWi I. \ 

/ d?>i \p=0 ■■■ dp m \P=0 \ 



A 



dH,, 



\ 9p x IP=° 



Note that 



drate^ 1 



a=0 



and therefore 

d ~ 4 io = / 4 (d, o) o (d - o)(i - o)^- 1 ) + f t (d, i) (?) (i - o)^- 1 ) = d [Md, i) + m 

Analogously 

drate}^ 

" i = d\g i (d,i)+9i(d,0)]. 



Then 



and thus, 



dHi 
dp. 



dpi 

\p=o = ir lj ^2P, j ( y d)w j ( y d) 
d 

( 



drate 



J-d 



\ rate)^\ - rate^ 1 ^ 



drate 1 ^} 
dpi 



A 



{rate^ + rate^ ) | 

7T 11 X 1 . . . 7l lm X m \ 
J 



where 



Pi(d)wi(d)d— 



fi(d,l)9i(d,0) - fi(d,0) gi (d,l) 



(mo)+ 9i (d,o)y 



Given Al, x; can be rewritten as 



Pi(d)wi(d)d— 



fi(d,l) 



9i(d,0) 



which is well defined since A4 holds. 

As mentioned in the text, A is primitive since II is primitive and since Al and A4 are satisfied 
implying that Xi > 0. 11 Thus, by the Perron-Frobenius Theorem (which applies to primitive 



ii 



In fact, with two types A is a positive matrix since < n < 1. 
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matrices) the maximum eigenvalue, denoted p hereafter, is positive and its corresponding 
eigenvector, denoted by u hereafter, is also positive. 

We show next that the condition for diffusion from a small seed, or the instability of 
p = 0, corresponds with the condition that the largest eigenvalue of A is larger than 1. 

Let us first show that if p > 1 then p = is unstable. Note that if p > 1 then 



Thus, picking small enough 5 so that 5ui < e for each i, satisfies the definition of diffusion 
from a small seed with 5u (or instability of 0). 

To see the converse, first consider the case such that p < 1. Given e > consider any v 
such that < Vi < e for all %. Suppose that Av > v. It then follows A(Av) > Av > v as 
A is nonnegative and has at least one positive entry in each row. Iterating, it follows that 
that A f v > v for any t. However, choose 6 such that Su > v. Given that A is is nonnegative 
and has at least one positive entry in each row, and both vectors are positive, it follows that 
A5u > Av, and similarly that 



given that p < 1, which is a contradiction. 

To complete this part of the proof consider the case such that p = 1. Consider e > 0. 
Consider any vector v such that v, < e. Note that for any small enough 8 > the largest 
eigenvalue of A — 51 is less than 1. Thus, by the argument above, (A — 5I)v is not greater 
than v. Therefore, Av is not greater than v. | 

Proof of Theorem 2: We have already shown that p = is unstable if and only if the 
largest eigenvalue of matrix A is above 1. Let us now complete the proof by examining the 
eigenvalue in the two-type case. The eigenvalues of a 2 x 2 matrix are easily computed. 
Writing 



A5u = p5u > 5u. 



A5u > AW. 



Given our previous claim, this then implies that 



A l 5\i > v 



for all t. However, 



Abu = 5p*u ->■ 




the largest eigenvalue of A is 



(an + a 22 ) + a/ (a n + a 22 ) 2 - 4(ana 2 2 - a 12 a 2 i) 



2 



or equivalently 



On + a 22 + a/(2 - an - a 22 ) 2 - 4 + 4a n + 4a 22 - Aa n a 2 2 + 4a i2 a 2 i 

2 



12 



Note that since A is primitive, its largest eigenvalue is real and positive. 
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Thus, fi is larger than 1 if and only if 

au + a 22 



>1 (7) 



or 

- 1 + an + a 22 - a u a 22 + a 12 a 2 i > 0. (8) 

Given that an = nxi, a 22 = ttx 2 , a 12 = (1 — tt)x 2 and a 21 = (1 — tt)xi then conditions (7) 
and (8) imply that diffusion (i.e., instability of p = 0) occurs if and only if 

2 

71 > 9 

Xi + x 2 

or 

tt(xi + x 2 — 2xiX 2 ) + X\X 2 — 1 > 0. (10) 
Case 1: f^j^ > 1- In this case, condition (10) is equivalent to 

1 - XiX 2 
7T > 

xi + x 2 - 2xix 2 

and therefore diffusion occurs in this case if and only if 

• r 1 — x l x 2 2 

7i > mm{ , }. (11) 

xi + x 2 — 2x\x 2 x\ + x 2 

Case 2: < 1. In this case, condition (10) is equivalent to 

XiX 2 — 1 

71 < 



2x\X 2 — X\ — x 2 
and therefore diffusion occurs in this case if and only if 



2 x x x 2 -\ 

< TX Or 7T < . (12) 



x\ + x 2 2x x x 2 —Xi—x 2 

Case 3: = 1- In this case, condition (10) simplifies to Xix 2 > 1, and and therefore 

diffusion occurs in this case if and only if 

o 

< 7r or X\X 2 > 1 (13) 



Xi + x 2 



Let us now show part (1) of Theorem 2. 

Suppose that X\X 2 > 1 holds. Then f^r—r 2 can fall into any of the cases above. If it were 
greater than 1, then Xi ^~^2x 1 x 2 < ® which in particular by Case 1 and (11) implies that 
there is diffusion for any tx G (0, 1). If it were equal to 1, then by Case 3, the result holds. 
If instead < 1 then Case 2 applies. In that case, referring to Figure 1, (xi,x 2 ) lies 

above the upper-most curve, 13 and it is clear that there would exist another profile (xi,x 2 ) 



13 Thc relative positions of the curves are easily checked, and note the plus and minus signs that indicate 
whether one is above or below 1 for the corresponding colored expression. 
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such that x\ < X\ and £2 < ^2 and which lies in the regions considered previously (that is, 
where Xl +~*-2x 1 x 2 — Therefore diffusion for (£i,£2) occurs for all 7r e (0,1), which in 
particular implies that for the larger case {x\, x?) diffusion would also occur for all tt G (0, 1) 
as the largest eigenvalue of a larger matrix is necessarily larger than the largest eigenvalue 
of a smaller matrix. 




Figure 1: The relationship between the key expressions in the proof of Theorem 2. 

Next, we show part (2) of Theorem 2. Suppose that X1X2 < 1. This implies that ^ + ^ 2 > 1 
(see Figure 1) or else that x\ = X2 = 1 in which Case 3 applies and there cannot be diffusion. 
Thus, let us analyze the situation where f^j^ > 1 and Case 1 applies. Diffusion occurs 
if and only if n > mini — }~ XlX2 — , — 7 — }. Note that if x 1 + x 2 < 2 then — 7 — > 1 and 

J l xi+x 2 -2x 1 x 2 ' x 1 +x 2 J 1 z x 1 +X2 

therefore diffusion occurs if and only if n > x ^~^-2x!X2 ' ^' 011 ^ e con t ra ry, X\ + X2 > 2 
then, it is straightforward to show that — 7 — > — }~ Xl % 2 — which also implies that diffusion 

' X1+X2 X1+X2— 2x\X2 1 

in such a case occurs if and only if it > — 1 ~ X1X2 — . | 

J X\+X 2 — lX\X2 

Proof of Lemma 1: Given the proof of Theorem 3, it follows that if Av > v for some 
v > then fi > 1. Then, choose 5 such that 5u < v. It follows that A5u < Av (since A is 
nonnegative and has at least one positive entry in each row), and similarly that 

//Su = A^u < A*v, 

and the first expression is growing with . | 
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